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ABSTRACT 


The  standard  Multiple  use  calibration  procedure  due  to  Scheffe  (1973) 
states  that  with  probability  1-6,  the  proportion  of  calculated  confidence 
intervals  containing  the  true  unknowns  is  at  least  1-a  in  the  long  run.  The 
probability  1-6  refers  to  the  probability  that  the  calibration  experiaent 
results  in  a  'good'  outcoae.  In  Scheffe's  foraulation  a  good  outcoae  involves 
both  coverage  of  the  true  underlying  regression  curve  and  an  upper  confidence 
liait  for  a  ,  the  scale  paraaeter.  Scheffe's  procedure  is  fairly  difficult  for 
practitioners  to  apply  because  it  relies  on  tables  that  are  not  easy  to  use.  A 
simpler  notion  of  'goodness'  which  only  requires  the  calibration  experiaent  to 
result  in  coverage  of  the  underlying  regression  leads  to  easily  calculated 
confidence  intervals  for  the  unknowns.  In  addition,  these  intervals  are 
generally  shorter  than  Scheffe's.  An  application  exaaple  is  given  to  illustrate 
the  technique. 

1.  Introduction 

A  calibration  curve  is  often  used  to  relate  instrument  readings  to 
established  standards  and  thereby  provide  meaningful  measurements.  Accounting 
for  the  resulting  uncertainty  from  calibration  is  of  central  importance.  A 
feature  of  critical  interest  is  how  often  the  calibration  curve  is  to  be  used. 
Most  often,  in  our  experience,  the  calibration  curve  will  be  used  many  times. 
Consequently,  the  measurements  will  be  correlated  and  this  dependence  must  be 
accounted  for.  Calibration  procedures  that  account  for  such  dependence  have 
been  developed  for  the  linear  aodel  by  Lieberaan,  Miller,  and  Hamilton  (1967) 


and  Scheffe  (1973).  In  this  article  we  present  a  new  aethod  for  coping  with 
multiple  use  calibration.  Our  aethod  is  based  on  a  modification  of  the  Scheffe 
(1973)  confidence  statements,  but  the  result  is  far  simpler  to  implement  and 
generally  leads  to  shorter  intervals.  Intervals  akin  to  ours  have  been  used  by 
scientists  and  engineers  for  some  time  without  theorectical  justification  for 
example,  see  Hockersaith  and  Ku  (1964). 

Consider  for  example  the  simple  linear  regression  model.  Assume  that  the 
responses  y  are  linearly  related  to  predictors  x  with  slope  p^  and 
Intercept  pQ,  and  that  the  responses  are  normally  distributed  with  standard 
deviation  a  .  Symbolically,  we  write  the  model  as 


(1.1) 


y  m  p0  *  Pj*  ♦ 

a  -  N(0, 1 ) . 


A  training  sample  (or  calibration  experiment)  of  size  N  is  used  to  construct 


the  classical  least  squares  estimates  (/^./s^.o)  of  (£Q.  pj.o)  and  then  the 
calibration  curve  pQ  *  p^x  is  formed  with  oS(x)  denoting  the  estimated 
standard  deviation  of  the  curve.  The  standard  deviation  of  pQ  *  p^x  is 
defined  to  be  oS(x),  the  estimated  standard  deviation  is  obtained  if  a  is 


replaced  by  o.  We  denote  the  observations  (y, . y„)  of  the  experiment  by  Y 

1  N 

and  use  to  denote  a  specific  outcome  of  Y. 


After  this  calibration  experiment  is  performed  and  the  calibration  curve  is 


constructed,  we  typically  observe  a  series  of  new  "responses"  y^y^,...  which 

•  * 

satisfy  (1.1).  The  associated  true  values  Xj,x2 .  are  not  observed  but  are 

*  * 

to  be  estimated  and  confidence  intervals  for  x^.x^,...  are  to  be  constructed. 


The  usual  estimate  Is 


Xj  *  (yj  -  fiQ)/Py  and  the  regaining  problem  concerns 
confidence  Intervals.  In  what  follows.  It  Is  convenient  to  write  the  true  curve 
as  f(x)  *  pQ  +  p^x  and  the  constructed  calibration  curve  as  f(x)  -  pQ  +  p^x. 
The  calibration  region  or  the  set  of  x's  of  interest  will  be  denoted  by  I. 

There  is  fairly  broad  agreement  on  what  to  do  in  the  case  that  a  single 
unknown  x*  is  to  be  estiaated  based  on  y*.  A  confidence  interval  for  x* 
can  be  obtained  as  follows;  For  given  x  ,  let  J(x)  be  a  (l-a)-lOO*  prediction 
Interval  for  the  response  y,  i.e., 

J(x)  -  f(x)  t  tN2(l-a/2)o(l+S2(x))1/2, 

where  t„  _ ( 1 -ot/2 )  is  the  l-a/2  percentile  of  the  Student  t-distribution  with 
N-2  degrees  of  freedom .  For  any  given  y,  let  K(y)  be  the  set  of  x  for 
which  y  is  "correctly"  predicted  in  the  sense  of  being  in  the  prediction 
interval,  i.e.,  K(y)  *  {xjy  «  J(x)}.  Wien  we  know  that  K(y*)  is  a  valid  l-o 
confidence  interval  for  x*.  In  syabols, 

Pr{x*sK(y*)}  -  Pr{y*tJ(x*)>  -  1  -a. 

Difficulties  arise  when  one  wishes  to  estimate  a  sequence  of  unknowns 

«  *  «  * 

Xj.Xg,...  from  observations  yj,y2,...  .  For  this  multiple  use  case,  Scheffe 
(1973)  suggests  aodlfylng  the  prediction  bands  to  be  of  the  fora 

Jg(x)  “  (y|f(x)  -  oCCj+CgSCx))  s  y  <f(x)  ♦  o(c1+c2s(x))> . 


(1.2) 


These  generally  lead  to  confidence  intervals  for  x  given  by  K (y  )  - 

J  S  J 

(x|y  eJ  (x) )} ;  rarely  in  practice  are  these  sets  disconnected.  The  constants 
J  * 

c  and  c  have  to  be  chosen  to  satisfy  uncertainty  criteria  required  of  the 
confidence  intervals. 

The  Scheffe  aethod,  as  does  ours,  involves  an  uncertainty  statement  that 
encoapasses  two  sources  of  error:  one  froa  the  calibration  experiaent,  the 
other  froa  the  post  calibration  aeasureaents.  The  data  froa  the  calibration 
experiaents  are  used  to  estiaate  f  and  a  and  one  aeasure  of  uncertainty  is 
the  probability  of  the  calibration  experiaent  producing  a  'good'  estiaate  of  f 
and  a.  Foraally,  Scheffe  defines  a  'good'  set  by 


(1.3)  Gg  -  (yj  |  f  (x)-f  (x)  J  1  c2oS(x)  all  xel  and  a/a  2:  b) 

where  b  and  c2  are  chosen  so  that  the  probability  that  the  experiment  is 
good  is  1-0  that  is, 

(1.4)  PA  G_]  t  1-0.  all  fi.a2 

p.o  S 

(here  ) .  The  connection  between  b,  c2  and  the  c^,  c2  of  (1.2)  and  o, 

O  is  spelled  out  in  Scheffe  (1973)  and  we  do  not  repeat  the  details.  The 
aeanlng  of  (1.3)  and  (1.4)  is  clear:  'good'  refers  to  f  covered  and  a 
bounded  and  the  probability  of  'good'  outcoaes  is  controlled  to  be  at  least  l-o. 

The  other  level  of  uncertainty  involves  the  probability  of  covering  the 
true  x*  when  a  new  observation  y*  is  aade.  Using  the  intervals  defined  at 
(1.2)  Scheffe  requires  that,  given  a  'good'  outcowe.  the  probability  of  covering 


the  true  x*  is  at  least  1-a.  In  foraal  teras 


(1.5)  P/j,o*>X*lKS(y*)  3  x*  IX-  )Ll  *  1-a 

2 

for  all  p,a  ,  all  x*4l,  all  y4Gg. 

If  (1.5)  holds  then  the  lone-run  proportion  of  confidence  intervals  coverine  the 

true  unknowns  (x*)  will  be  at  least  1-a  if  the  calibration  experiaent  resulted 

in  an  outcoae  in  G  .  The  choice  of  b  and  the  inclusion  of  a  bound  on  o  in 

the  definition  of  G$  is  to  allow  the  calculation  leading  to  (1.5). 

The  choice  of  c,  and  c0  that  assures  both  (1.4)  and  (1.5)  can  be  obtained 
X  z 

froa  tables  in  Scheffe's  paper.  However,  as  noted  in  Lechner,  Reeve  and 
Spiegelaan  (1982),  this  table  is  not  easy  for  practitioners  to  use.  The 
difficulty  is  the  need  to  coapute  the  ainiaua  and  aaxiaua  of  S(x)  over  the 
region  I  of  Interest  and  this  can  be  a  foraidable  task  in  linear  aodels  aore 
general  than  (1.1). 

Our  approach  is  first  define  the  'good'  set  not  by  G^  but,  by 

(1.6)  G  «■  (y.  |  |f(x)  -  f(x)|  S  CgOS(x) ,  all  X4l), 

where  Cg  is  to  be  chosen  to  satisfy  (1.8)  below.  This  restricts  attention  to 
whether  the  true  f  is  covered;  a  is  auxiliary  to  that  concern.  In  Section  2 
we  show  how  this  aodiflcatlon  leads  to  a  substantial  siapllf ication  in  the 
calculation  of  confidence  intervals  for  the  unknown  (Xj).  In  fact,  we  take  the 
Cj  and  c2  of  (1.2)  as 


-5- 


(ID 


C1  '  t»-2l,-a/2> 


c2  ‘  ‘2F2.H-a(10))1/2 


(F 2  l-®)  is  the  1-0  quantile  of  the  indicated  F  distribution). 

To  get  the  second  part  of  the  uncertainty  statement  note  that  the  choice  of 
c2  in  (1.7)  results  in 

(1.8)  0atY«.Gl  fc  1-8  for  all  fi.o 2. 


If  we  let 


Z  -  aax  | f (x)  -  f(x)|/S(x), 
x&I 


so  that  G  can  be  restated  as  (Z  S  CgO) ,  then  the  Intervals  defined  through 
(1.2)  with  Cj,  c2  taken  from  (1.7)  satisfy 


(19)  P.  _*[K(y*)  3  x*| Z  -  z.  G]  fc  1  -  a 

P»u  *  * 

2 

for  all  p,a  ,x*tl,  and  z.  Note  that  the  choice  c,,  c  of  (1.7),  or  any  choice 

permitting  (1.8)  and  (1.9)  to  hold,  will  differ  from  the  Cj ,  Cg  that  satisfy 
(1.4)  and  (l.S). 

In  our  experience  in  practical  case  we  get  narrower  intervals  by  our  method 
than  by  Scheffe's.  This  is  borne  out  in  the  example  in  Section  3  below.  Which 
is  more  appropriate,  G  or  ,  (1.5)  or  (1.9),  for  uncertainty  statements  is  not 
clear  on  prior  grounds.  Both  formalize  the  long  run  conduct  of  post¬ 
calibration  experiments  in  terms  of  the  outcomes  of  the  initial  calibration 
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experiment.  The  simplicity  of  (1  ."</  and  its  consequences  (1.8)  and  (1.9)  appear 
to  us  to  have  a  decided  advantage  -  practitioners  understand  the  role  of  the  P 


and  the  t  distributions  and  have  immediate  access  to  appropriate  tables,  the 
uncertainty  statements  of  (1.8)  and  (1.9)  have  little  added  conceptual 
complications,  and  there  are  no  extra  computational  difficulties  in  obtaining 
the  needed  Cj ,  Cg .  Indeed,  there  is  a  simple  graphical  explanation  in  Figure  1 
which  shows  how  the  confidence  band  on  f  and  the  error  in  y*  combine  to 
produce  the  desired  interval  for  x*.  In  Figure  1  the  (1-0)  confidence  band  is 
given  by  the  outer  curves;  on  the  y-axla  there  is  a  (l-o)  interval  with  y*  in 
the  center.  By  reflecting  the  endpoints  of  the  latter  interval  through  the 
confidence  band  we  get  two  points  that  surround  x*  and  define  the  confidence 
interval  for  x*.  For  these  reasons  of  simplicity  and  utility  we  are  compelled 
to  recommend  use  of  G  and  the  consequent  easy  and  quick  use  of  (1.7). 

The  choices  of  Cj  and  Cg  can  be  improved  at  the  expense  of  added 

analysis  and  computation  (see  Remarks  1  and  2  of  Section  2).  If  the  model  (1.1) 

is  replaced  by  a  multiple  regression  with  p  variables  including  the  intercept, 

1/2 

then  Cj  -  tN_p( l-a/2)  and  Cg  -  (pFp  N_pU-0))  . 


2.  Derivation  of  Results 

Assume  f  follows  a  multiple  regression  model  with  p  parameters.  As 
noted  in  (1.6)  of  Section  1  we  regard  a  good  outcome  of  the  calibration 
experiment  as  one  that  is  in  G.  Our  first  step  is  to  call  on  standard  least 
squares  theory  and  note  that  the  requirement 
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P[G]  -  P[|(f(x)  -  f (x)/S(x) |  S  c 2o  all  x*l]  i  1-0 

1/2 

is  fulfilled  if  c_  *  (pF_  u  (1-0))  (this  can  be  seen  in  Scheffe,  1973). 

<  p.w-p 

Ideally  we  would  like  the  saallest  possible  but  the  present  choice  is 

siaple  and  adequate  for  now.  Me  note  that  Scheffe  uses  a  multiple  y  of  this 
c_  for  soae  .95  £  y  2  1.2  whose  choice  depends  on  S(x),  p,  N. 

Recall  from  Section  1  that  G  can  be  expressed  as  (y|Z  5  c^o)  where  Z 
defined  following  (1.8).  The  Independence  of  f  and  o,  which  cones  fron  the 
assumption  of  normality,  implies  that  Z  and  a  are  independent  random 
variables.  This  independence  and  straightforward  calculation  produces 

(2.2)  P[Z  S  c2d.  olq  |  Z>z] 

A  A 

-  P[z  <  c2o,  a  £  q] 

■  min  [p[o  i.  z/Cg]  •  P[o  i-  ql] 

A  m 

£  P(o  i  z/c2l  Pfo  t  q] 

m  A 

-  P[Z  <  c2o  I  Z  -  z]  P[o  £  q]. 

By  dividing  the  two  ends  of  (2.2)  we  get 

(2.3)  P[o  2  q  |  Z  i  c  o,  Z  -  z]  2:  P[o  2:  q]. 

Let  A  •  (Z  •  z,  Z  i  c.o).  Then 
z  c 

P[K(y)  J  x  |  AJ  ■  P[f (x)-f (x)-c2oS(x)-oc1  S  o*  S  oCj+f (x)-f (x)+c2oS(x) | A^] 

(2.4)  t  Pt-oCj  <  oe  S  oCj |  Az]  -  )(<Mc1t)-4(-c1t))dH(t) 


is 
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where  H  =  the  conditional  distribution  of  a/a  given  A  .  The  integrand  in 

z 

(2.4)  is  clearly  increasing  in  t  and  we  have  just  shown  in  (2.3)  that 

H(t)  <  P [a/a  <  t] . 

Hence  the  right  side  of  (2.4)  is  at  least  as  large  as 

J($(Clt)  -  «(-c1t))d  P[o/o  <  t]  -  P[|tN_p|  S  Cj]. 

Choose  c,  ■  t„  (l-a/2)  and  obtain  the  result:  given  Z,  G  the  probability  of 
1  N-p 

covering  the  unknown  x  is  at  least  1-a. 

1/2 

Remark  1 .  The  use  of  c2  -  (pFp  ^  p(0))  is  only  made  for  simplicity. 

It  is  known  (Wynn  and  Bloomfield.  1971;  Knafl,  Sacks,  and  Ylvisaker,  1985)  that 
better  choices  of  c2  can  be  made  to  provide  statements  like  (1.8).  An 
immediate  consequence  of  the  arguments  here  is  that  such  a  c2  combined  with 
Cj  -  tNp( l-a/2)  also  provides  useable  confidence  bounds  and  narrower  ones  than 
described  above. 

Remark  2.  Examination  of  (2.4),  especially  going  from  the  second  to  the 
third  expression,  reveals  that  improvement  can  be  obtained  by  using  the 
monotonicity  properties  of  the  t  distribution  to  replace  the  third  term  by 

Pf-2oS(x)c2-c1o:ofc1<c1o|y1 . 

This  leads  to  choosing  Cj  to  satisfy 
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(2.5) 


P[-c1-2c2S(x)<tNp<c1]  -  1-a. 

Finding  Cj  to  satisfy  (2.5)  requires  knowledge  of  S(x).  Since  the  x's  are 

unknown,  this  is  unrealistic  but  we  can  replace  S(x)  by  ain  S(x)  -  S  (say) 

x  0 

and  then  use  t-tables  to  find  c^  so  that 

P[-c1-2c2SQ<tN_p<c1]  -  1-a. 

Unless  «  0,  such  a  c,  is  smaller  than  tu  (l-a/2).  A  less  precise  but 

O  1  W“p 

practical  approach  is  to  replace  S(x)  by  S(x)  where  x  »  f  1(y)  and  treat 
S(x)  in  (2.5)  as  if  it  were  non-randoa.  This  gives  data-dependent  c.  but 
leads  to  noalnal  values  of  confidence  that  should  be  close  to  correct. 

Remark  2  is  a  relatively  siaple  computation  and  its  implementation  can  lead 
to  substantial  savings  in  the  lengths  of  intervals.  This  is  clearly  Indicated 
in  the  example  presented  in  Section  3,  see  Table  2.  We  do  not  over-emphasize 
Remarks  1  and  2  because  our  concern  is  to  make  clear  the  advantage  of  the  quick 
and  easy  use  of  (1.7).  Our  recommendation  overall  would  be  to  employ  Remark  2 
as  well  because  it  is  relatively  easy  to  carry  out.  Implementing  Remark  1  is 
unlikely  to  yield  great  benefits  unless  the  degree  p  is  bigger  than  1  or  the 
problem  is  one  where  nonparametrlc  calibration  (see  Remark  4  below)  is  needed. 

In  such  cases  it  would  be  necessary  to  incorporate  substantial  computations  (see 
Knafl,  Sacks,  Spiegelman,  and  Ylvlsaker,  1984;  Knafl,  Sacks,  and  Ylvisaker, 

1985)  and  would  belle  our  use  of  the  words  "quick  and, easy." 

Remark  3.  It  is  straightforward  to  adjust  the  statements  and  procedure  if 
the  calibration  experiment  is  performed  in  one  laboratory  but  the  calibrations 
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are  carried  on  elsewhere.  This  situation  leads  possibly  to  two  o's,  one  in  the 
calibration  experiaent,  the  other  froa  new  observations  y*.  Under  usual 
assuaptions  of  Independence,  the  aodifications  needed  are  easy  to  aake. 

Reaark  4.  (  Nonparaaetric  calibration.)  Knafl,  et  al.  (1984)  discuss 
calibration  when  the  aean  of  the  responses  f(x)  does  not  follow  a  strict  linear 
aodel  so  that  the  use  of  a  aultiple  regression  as  done  above  would  be 
inappropriate.  Exaainatlon  of  the  details  of  arguaents  used  in  that  paper  shows 
that  considerable  siaplif icatlon  in  computation  can  result  by  use  of  the  G  of 
(1.6).  He  forego  elaboration  of  the  necessary  technicalities. 

3.  Exaaple 

A  simple  exaaple  arises  in  atoaic  absorption  spectroscopy  for  which  there 
are  twelve  data  points  as  shown  in  Table  1.  Figure  2a  is  a  plot  of  the  data  of 
Table  1.  Figure  2b  is  a  plot  of  the  residuals  froa  a  straight  line  fit.  These 
figures  indicate  that  a  straight-line  fit  to  the  data  is  entirely  reasonable. 

He  chose  a  ■  0.10  and  0  -  0.003.  Following  Scheffe  we  would  get  calibrations 

based  on  a  ■  0.003,  c  »  2.32  and  c_  »  2.36,  see  the  discussion  after  equation 

X  » 

(1.6).  As  an  exercise,  the  reader  nay  wish  to  atteapt  the  calculation  of  the 

Scheffe  constants  and  coapare  the  effort  involved  with  that  required  to  look  up 

1/2 

our  Cj  *  t10(0.95)  *  1.81  and  Cg  -  (2Fg  1Q(0.90))  -  2.41.  The  calibration 

intervals  are  given  in  Table  2  for  selected  values  of  y;  these  intervals 

provide  nuaerlcal  evidence  of  our  claia  that  the  Scheffe  intervals  are  typically 
significantly  longer.  The  difference  in  lengths  is  due  to  the  difference 
between  G  and  Gg  in  (1.6)  and  (1.3). 
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Table  1 


Ato*lc  Absorption  Spectroscopy 


x 


0.0 

0.0 

0.0 

0.0 

.050 

.050 

.100 

.100 

.200 

.200 

.500 

.500 


_ y_ 

.045 

.047 

.051 

.054 

.084 

.087 

.115 

.116 

.183 

.191 

.395 

.399 


x  -  amount  of  copper  In  a  dilute 
acid  solution  measured  In 
mlcrograas/mllllllter. 

y  -  Instrument  response  In 
absorbance  units. 


(The  data  have  been  rounded  to  three  decimal  places.) 


Table  2 

Calibration  Intervals:  Scheff /Compared  with  Section  1 
y  •  new  observation 

\  ■  lower  endpoint  of  calibration  Interval 

Xy  ■  upper  endpoint 

A  ■  length  of  calibration  Interval 

First  row  In  each  entry  Is  Schefff  calibration;  second 
row  Is  calibration  according  to  section  2;  third  row 
uses  remark  2;  fourth  row  uses  remarks  1  and  2. 


i _ i 


.06 

.0011 

.0313 

.0302 

Scheffe/ 

.0034 

.0290 

.0256 

Section  2 

• 

.0045 

.0279 

.0233 

Remark  2 

.0051 

.0273 

.0222 

Remarks  182 

.10 

.0592 

.0883 

.0292 

.0615 

.0860 

.0245 

.0626 

.0849 

.0223 

.0631 

.0845 

.0214 

.15 

.1314 

.1601 

.0287 

.1337 

.1578 

.0240 

.1348 

.1566 

.0218 

.1353 

.1562 

.0209 

.22 

.2316 

.2614 

.0298 

.2340 

.2591 

.0251 

.2351 

.2579 

.0229 

.2356 

.2574 

.0219 

.30 

.3453 

.3780 

.0327 

.3476 

.3757 

.0281 

.3487 

.3745 

.0258 

.3494 

.3736 

.0244 

.38 

.4586 

.4949 

.0363 

.4608 

.4926 

.0318 

.4619 

.4914 

.0295 

.4629 

.4904 

.0275 
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i  DILUTE  ACID  SOLUTION:  PLOT  OF  DATA 
FIGURE 


DILUTE  ACID  SOLUTIOND:  RESIDUAL  PLOT 
FIGURE  2* 


